GPT-OSS prompt caching fix by dmitryryabkov · Pull Request #297 · lmstudio-ai/mlx-engine

dmitryryabkov · 2026-03-29T10:47:07Z

Fixes lmstudio-ai/lmstudio-bug-tracker#1697

Fixed prompt caching for GPT-OSS 20B MLX models.

Two fixes:

cache_wrapper.py: Added fallback when cache layers don't expose offset attribute
batched_model_kit.py: Separated cross-prompt cache key from live cache key

The main issue was that batched models (like GPT-OSS) were tracking generated tokens in the cross-prompt cache key, preventing cache hits for new prompts with overlapping content.

Testing:

Unit test added in tests/test_cache_wrapper.py
Verified locally with GPT-OSS 20B model in LM Studio (replaced the contents of ~/.lmstudio/extensions/backends/vendor/_amphibian/app-mlx-generate-mac14-arm64@20/lib/python3.11/site-packages/mlx_engine/ with the updated file)

Two fixes: 1. cache_wrapper.py: Added fallback when cache layers don't expose `offset` attribute 2. batched_model_kit.py: Separated cross-prompt cache key from live cache key The main issue was that batched models (like GPT-OSS) were tracking generated tokens in the cross-prompt cache key, preventing cache hits for new prompts with overlapping content.

github-actions · 2026-03-29T10:47:17Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

dmitryryabkov · 2026-03-29T11:05:19Z

I have read the CLA Document and I hereby sign the CLA

dmitryryabkov added 2 commits March 28, 2026 22:22

Revert unintended changes

fce2ffd

dmitryryabkov mentioned this pull request Mar 29, 2026

Prompt caching doesn't work for MLX version of GPT-OSS 20B lmstudio-ai/lmstudio-bug-tracker#1697

Open

github-actions bot added the CLA signed Indicates that all contributors have signed label Mar 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPT-OSS prompt caching fix#297

GPT-OSS prompt caching fix#297
dmitryryabkov wants to merge 2 commits intolmstudio-ai:mainfrom
dmitryryabkov:fix/prompt-caching-clean

dmitryryabkov commented Mar 29, 2026

Uh oh!

github-actions bot commented Mar 29, 2026 •

edited

Loading

Uh oh!

dmitryryabkov commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dmitryryabkov commented Mar 29, 2026

Uh oh!

github-actions bot commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dmitryryabkov commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot commented Mar 29, 2026 •

edited

Loading